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Abstract 

We study the problem of optimizing nonlinear objective functions over matroids presented by or- 
acles or explicitly. Such functions can be interpreted as the balancing of multi-criteria optimization. 
We provide a combinatorial polynomial time algorithm for arbitrary oracle-presented matroids, that 
makes repeated use of matroid intersection, and an algebraic algorithm for vectorial matroids. 

Our work is partly motivated by applications to minimum-aberration model-fitting in experi- 
mental design in statistics, which we discuss and demonstrate in detail. 



1 Introduction 

In this article, partly motivated by applications to minimum-aberration model-fitting in experimental 
design, which will be discussed briefly at the end of this introduction and in detail in §5, we study the 
problem of optimizing an arbitrary nonlinear function over a matroid, set as follows. 

Nonlinear Matroid Optimization. Given matroid M on ground set := {1, . . . , n}, integer weight 
vectors Wi = . . . ,Wi^n) G for i = 1,... ,d, and function / : M'^ ^ M, find a matroid base 

B G B{M) C 2^ minimizing the "balancing" by / of the d weights Wi{B) := YljeB base B, 

f{wi{B),...,WdiB)) = f l'^wij,...,^Wd,j 

Vies jeB 



(All necessary basics about matroid theory are provided in H'2.1[ For more details consult [H] or [17j.) 

Nonlinear matroid optimization can be interpreted as multi- criteria matroid optimization: the d 
given weight vectors wi, . . . ,Wd represent d different criteria, where the value of base B G B{M) under 
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criterion i is its i-th weight Wi{B) = "^j^b where the objective is to minimize the "balancing" 

f{wi{B), . . . , Wd{B)) of the d given criteria by the given function /. 

In fact, we have a hierarchy of problems of increasing generality, parameterized by the number d 
of weight vectors. At the bottom lies standard linear matroid optimization, recovered with d = 1 and 
/ the identity on M. At the top lies the problem of maximizing an arbitrary function over the set of 
bases, with d = n and Wi = Ij the i-th standard unit vector in M" for all i, see Proposition 12.41 below. 

It will be often convenient to collect the weight vectors in a d x n matrix W. Thus, the i-th row 
of this matrix is the i-th weight vector Wi = . . . ,Wi^n)- For each subset S N we define its 

W-profile to be 



which is the vector giving the value of S under each of the d weight vectors. The nonlinear matroid 
optimization then asks for a matroid base B S B{M) minimizing the objective function f{W{B)). 

The computational complexity of nonlinear matroid optimization depends on the number d of 
weight vectors, on the representation of weights (binary {wi,j) versus unary lifjjl, see §2.1|) . on the 
type of function / and its presentation, and on the type of matroid and its presentation. We will be 
able to handle an arbitrary function / presented by a comparison oracle that, queried on x, y G Z*^, 
asserts whether or not f{x) < f{y), and an arbitrary matroid presented by an independence oracle, 
that, queried on / C A^, asserts whether or not / is independent in M, that is, whether I Q B for 
some base B G B{M), see §2.11 These are very broad presentations that reveal little information (per 
query) on the function and matroid, making our problem setting rather expressive but difficult for 
achieving strong complexity results. 

Standard linear matroid optimization is well known to be efficiently solvable by the greedy al- 
gorithm. Nonlinear matroid optimization with d fixed and / concave turns out to be solvable in 
polynomial time as well [HI [11]. In fact, using sophisticated geometric methods, the nonlinear opti- 
mization problem with d fixed and / concave has been recently shown to be efficiently solvable for 
bipartite matching [4] and for broader classes of combinatorial optimization problems [12] . Therein, 
maximization rather than minimization form is used, and hence convex rather than concave functions 
are considered. 

However, generally, nonlinear matroid optimization is intractable, even for uniform matroids. In 
particular, if d is variable then exponential time is needed even for {0, l}-valued weights, and if the 
weights are presented in binary, then exponential time is needed even for fixed dimension d = 1. See 
Propositions 12.3 1 1^-41 and 12.51 in the sequel for various intractability statements. 
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In spite of these difficulties, here we estabhsh the efficient solvabihty of the problem as follows. 



Theorem 1.1 For every fixed d and p, there is an algorithm that, given a matroid M presented 
by an independence oracle on the n-element ground set N, integers ai,...,ap G Z, weight vectors 
wi,...,Wd € {oi, . . . , ap}", and function / : M'^ ^ M presented by a comparison oracle, solves the 
nonlinear matroid optimization problem in time that is polynomial in n and max(aj). 

We also state the following natural immediate corollary concerning {0, 1, . . . ,p}-valued weights. 

Corollary 1.2 For every fixed d and p, there is an algorithm that, given n-element matroid M pre- 
sented by an independence oracle, wi, . . . ,Wd € {0, 1, . . . ,p}"', and function f : M*^ M presented by 
a comparison oracle, solves the nonlinear matroid optimization problem in time polynomial in n. 

The algorithm establishing Theorem 1 1.1 1 is combinatorial and makes repeated use of matroid inter- 
section (see e.g. [9] or [10], and see [7] for another recent interesting application of matroid intersection). 
In fact, it invokes the matroid intersection algorithm roughly n^'' times, and hence it is quite heavy. 
However, most matroids appearing in practice, including graphic matroids and those arising in the 
applications to experimental design to be discussed later, are vectorial. Therefore, we also provide an- 
other, more efficient, linear-algebraic algorithm for vectorial matroids. Moreover, this algorithm applies 
to weights with unlimited number (rather than fixed number p) of different values Wij of entries. 

Theorem 1.3 For every fixed d, there is an algorithm that, given integer m x n matrix A, weight vec- 
tors wi, . . . ,Wd G Z", and function / : M*^ ^ M presented by a comparison oracle, solves the nonlinear 
optimization problem over the vectorial matroid of A in time polynomial in (A) and max\wij\. 

A specific application that can be solved by either the combinatorial algorithm underlying Theorem 
11.11 or the more efficient linear-algebraic algorithm underlying Theorem 11.31 is the following. 

Example 1.4 (minimum-norm spanning tree). Fix any positive integer d. Let G be any connected 
graph with edge set E = {ei, . . . , e^}, and let wi, . . . , € Z"- be weight vectors with Wij representing 
the values of edge ej under the i-th criterion. Let A be the vertex-arc incidence matrix of an arbitrary 
orientation of G. Then the vectorial matroid of A is the graphic matroid of G whose bases are the 
spanning trees of G. Now fix also any q that is either a positive integer or oo, and let / : ^ M 

d - rl 

be the Iq norm given by \\x\\q := (X^j=i l^il'')'? for finite q and ||a;||oo := max^^j^ \xi\. Note that a 
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comparison oracle for / = is easily and efficiently realizable. Then Theorems 11.11 and 11.31 assure 
that a spanning tree T of G minimizing the Iq norm of the multi-criteria vector, given by 

\\{w,{T),...,w,{Tm^ = \\W{T)\\^ 

is computable in time polynomial in n and max|it;jj|. Note that if P^^NP then the problem is not 
solvable in time polynomial in the binary length (wij) even for the graph obtained from a path by 
replacing every edge by two parallel copies, see Proposition 12.51 below. 

Experimental Design. We conclude the introduction with a brief discussion of the application to 
experimental design, elaborated in detail in §5. The general framework is as follows. We are interested 
in learning an unknown system whose output y is an unknown function <I> of a multivariate input 
X = {xi, . . . , Xfc) € M'^'. It is customary to call the input variables Xi factors of the system. In order to 
learn the system, we perform several experiments. Each experiment i is determined by a point pi = 
{pi^i, . . . ,Pi,k) and consists of feeding the system with input x := pi and measuring the corresponding 
output Hi = Based on these experiments, we wish to fit a model for the system, namely, determine 

an estimation $ of the function $, that satisfies the following properties: 

• It lies in a prescribed class of functions; 

• It is consistent with the outcomes of the experiments; 

• It minimizes the aberration - a suitable criterion - among models in the class. 

More detailed discussion and precise definitions will be given in §5. We have the following broad corol- 
lary of Theorems 11.11 and ll. 31 see §5 for the precise statement and its various practical specializations 
to concrete aberration criteria useful in optimal model-fitting in experimental design. 

Corollary 1.5 An aberration-minimum multivariate-polynomial model is polynomial time computable. 

The article proceeds as follows. In §2 we set some notation and preliminaries, make some prepa- 
rations for the algorithms in following sections, and demonstrate various intractability limitations on 
nonlinear matroid optimization. In §3 we discuss arbitrary matroids presented by oracles, and provide 
the combinatorial algorithm for nonlinear matroid optimization, thereby establishing Theorem ll.il In 
§4 we provide the more efficient algebraic algorithm for nonlinear optimization over vectorial matroids, 
thereby proving Theorem 11.31 We conclude in §5 with a detailed discussion of the experimental design 
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application and prove Corollary 1 1 . 5 1 and its various practical specializations. Readers interested mostly 
in experimental design can go directly to §5, where the minimum-aberration model-fitting problem is 
reduced to nonlinear optimization over a suitable matroid, and where each of the algorithms developed 
in §3-4 can be invoked black box. 

2 Preliminaries, Preparation and Limitations 
2.1 Preliminaries 

We use M for the reals, Z for the integers, and N for the nonnegative integers. The i-th standard unit 
vector in is denoted by Ij. The support of x G M" is the index set supp(x) := {j : Xj ^ 0} of 
nonzero entries of x. The integer lattice Z" is naturally embedded in R**. Vectors will be interpreted 
as either row or column vectors interchangeably - this will be relevant only when such vectors are 
multiplied by matrices - in which case the correct interpretation will be clear from the context. We 
often collect a sequence of vectors designated by a low case letter as the rows of a matrix designated 
by the corresponding upper case letter. Thus, our weight vectors Wi = {wi^i, . . . , Wi^n), i = 1, ■ ■ ■ , d are 
arranged in a d x n matrix W, and our design points pi = {pi,i, ■ ■ ■ ,Pi,k), i = 1, . . . , are arranged in 
an m X /c matrix P. The space R" is endowed with the standard inner product which, for w,x & M", is 
w ■ X := 'Yl^=i WiXi. Vectors w in are also regarded as linear functions on M" via the inner product 
w ■ x. So we refer to elements of as points, vectors, or linear functions, as is appropriate from the 
context. We write 1 for the vector with all entries equal to 1, of dimension clear from the context. 

Our algorithms are applied to rational data only, and the time complexity is as in the standard 
Turing machine model, see e.g. [H [6l [16]. The input typically consists of rational (usually integer) 
numbers, vectors, matrices, and finite sets of such objects. The binary length of an integer number 
z G Z is defined to be the number of bits in its binary representation, {z) := 1 -|- [log2(|2| -|- 1)] (with 
the extra bit for the sign). The length of a rational number presented as a fraction r = ^ with p,q ^"L 
is (r) '■= {p) + {q)- The length of an m x n matrix A (or a vector) is the sum {A) := j{cii.j) of the 
lengths of its entries. Note that the length of A is no smaller than the number of entries, {A) > mn. 
Therefore, when A is, say, part of an input to an algorithm, with m, n variable, the length {A) already 
incorporates mn, and so we will typically not account additionally for m, n directly. But sometimes 
we will also emphasize n as part of the input length. Similarly, the length of a finite set S of numbers, 
vectors or matrices is the sum of lengths of its elements and hence, since {S) > \S\, automatically 
accounts for its cardinality. Some input numbers affect the running time of some algorithms through 
their unary presentation, resulting in so-called "pseudo-polynomial" running time. The unary length 
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of an integer number z G Z is the number |2:| + 1 of bits in its unary representation (again, an extra 
bit for the sign). The unary length of a rational number, vector, matrix, or finite set of such objects 
are defined again as the sums of lengths of their numerical constituents, and is again no smaller than 
the number of such numerical constituents. Often part of the input is presented by oracles. Then the 
running time counts also the number of oracle queries. An oracle algorithm is polynomial time if its 
running time, including the number of oracle queries, is polynomial in the length of the input. 

Next, we make some basic definitions concerning matroids and set some associated notation. For 
matroid theory see [H] or [l7j. A matroid M is described by giving a finite ground set £{M) and a 
nonempty set B{M) of subsets of £{M) called the set of bases of M, such that for every B, B' G B{M) 
and i e B\B' there is an i' G B' such that B\{i}[J {i'} G B{M). A subset / of £{M) is independent 
in M if / C S for some base B G B{M). The set of independent sets of M is denoted by Z[M). Often 
it is convenient to let £{M) = N = {1,2, . . . ,n} for some positive integer n. The rank rank(S') of a 
subset S C f (M) is the maximum cardinality [/[ of an / G I{M) that is contained in S. The rank of 
the matroid is rank(M) := rank(iS(M)) and is equal to the cardinality \B\ of every base B G B{M). 
An independence oracle for M is one that, queried on / C f (M), asserts whether / is in X(M). An 
independence oracle allows us to compute the rank of every S C f (M) as follows. Start with I := 0. 
For each j G 5 do (in any order): if / U {j} is in T{M) then set I := I \ {j}. Output rank(S') := |/|. 

An important example of a matroid is the graphic matroid M{G) of a graph G with £{M[G)) 
being the edge set of G and B{M{G)) equal to the set of edge sets of spanning forests of G. A 
further key example is the vectorial matroid of an m x n matrix A (over a field F) with B{M) the 
set of subsets of indices of maximal linearly-independent subsets of columns of A. Throughout, when 
treating complexity issues, we assume that A has integer components, and that we do arithmetic 
over the rationals. A uniform matroid is any matroid that is isomorphic to the matroid hlm,n having 
ground set N = {l,...,n} and having all m-subsets of as bases. If Mi and M2 are matroids 
with disjoint ground sets, then their direct sum Mi © M2 has £{Mi © M2) := £{Mi) 1+) £{M2) and 
B{Mi ffi M2) ■■= {Bi \S B2 : Bi £ B{Mi),B2 G B{M2)}. A partition matroid is any matroid that is a 
direct sum of uniform matroids (Bl^iUmi,ni (with the ground sets of the Umi,rn labeled to be pairwise 
disjoint). If rrii = m and all Ui = k for all i, then we write ^ for this r-fold sum of Um,k- 

It is well known that all graphic matroids and uniform matroids are vectorial. Furthermore, any 
partition matroid that is the direct sum (Bl^^Ui^m of rank-1 uniform matroids is graphic: it is the 
matroid of the graph obtained from any forest on r edges by replacing the i-th edge by Ui parallel 
copies for i = 1, . . . , r. In particular, W^^is a, graphic matroid for any positive k and r. 
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2.2 Preparation 



Here we provide preparatory ingredients which will be used in algorithms in §3-4. We begin by showing 
that finding an optimal base for a nonlinear matroid optimization problem can be reduced to finding 
the optimal objective function value of a small number of subproblems. Consider data for a nonlinear 
matroid optimization problem, consisting of a matroid M, weight vectors wi, . . . ,W(i E 7,'^, and function 
/ : M*^ ^ R. Each subset S C N gives a subproblem of nonlinear matroid optimization as follows. The 
matroid of the subproblem is the restriction of M to 5, that is, the matroid M.S on ground set S in 
which a subset I C S is independent if and only if it is independent in M. Note that an independence 
oracle for the restriction matroid M.S is realizable at once from that of M. The weight vectors of the 
subproblem are the restrictions of the original weight vectors to S. The function / : M'^ ^ M in the 
subproblem is the same as in the original problem. We have the following useful statement. 

Lemma 2.1 The nonlinear matroid optimization problem of finding an optimal base of an n-element 
matroid is reducible in time polynomial inn to finding the optimal objective value o/n+l subproblems. 

Proof. Denote by f*{S) the optimal objective function value of the subproblem on S C N. Now 
compute the optimal objective function value for n + 1 such subproblems as follows. 

Start with S := N; 
Compute m := rank(S'); 

Compute the optimal value /* := f*{N) of the original problem; 

for j=l,2,. . . ,n do 

Compute rank(S' \ {i}); 

Compute the optimal subproblem value f*{S \ {j}); 
if rank(5 \ {j}) = m and f*{S \ {j}) = f* then set S := S \ {j}; 
end 

return B := S; 

It is not hard to verify that the set B obtained is indeed an optimal base for the original problem. □ 

We record for later the following statement, which follows directly from the definitions. 

Proposition 2.2 Consider n-element matroid M, weights w\, . . . ,Wd E , and / : M'' ^ M. Put 

U := {W{B) : B G B{M)} = {u G Z'^ : u = W{B) for some base B G 13{M)} . (1) 
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Then the optimal objective value of the corresponding nonlinear matroid optimization problem satisfies 



f* = min/(n) . 

Thus, the problem of computing /* reduces to that of constructing the set U of M^-profiles of bases. 
This really is the crucial component of the solution of the nonlinear matroid optimization problem. 
While the cardinality of C/ = {W{B) : B E B{M)} may be polynomial under suitable assumptions on 
the data, its direct computation by computing W{B) for every base is prohibitive since the number of 
matroid bases is typically exponential in n. Instead, we will construct a finite superset Z of "potential" 
VF-profiles of bases, satisfying [/ C Z C Z'^, and then filter U out of Z. However, this is not an easy 
task either: deciding if a given u G Z"^ satisfies u = W{B) for some base B is NP-complete already 
for fixed d = 1 and uniform matroids or partition matroids that are the direct sums of rank-1 uniform 
matroids, see Proposition 12.51 below. The way the filtration of U out of Z is done is precisely the key 
difference between our two algorithms for nonlinear matroid optimization in §3 and §4. 

2.3 Limitations 

Here we provide various intractability statements about the nonlinear matroid optimization problem 
and some of its relatives. In particular, we show that, if the weights are encoded in binary, then even 
the 1-dimensional problem, namely with fixed d = 1, over any matroid given explicitly or by an oracle, 
requires to examine the objective value of every base, and hence cannot be solved in polynomial time. 

It is convenient to define a certain class of rank k matroids on 2k elements, so as to generalize the 
uniform matroid Uk^2k and the partition matroids Z^i 2- For some r, 1 < r < k, partition K = {1, . . . ,k} 
into r parts, as ^l^-^Ki. Correspondingly, let Ki := {j : j £ Ki}. Let ki = \Ki\ = \Ki\. For i = 1, . . . , r, 
let Mi be a uniform matroid of rank ki on ground set KiUKi (so ^Ai = Uki,2ki)- Let M.r,k ■= ©i=i-A^j • 
Observe that Mi,k — l^k,2k (which is uniform) and M-k,k — ^12 (which is graphic), so hardness results 
with respect to the matroid classes Air.k cipply to uniform and graphic matroids. We note and will soon 
use that for any r, 1 < r < fc, \B{Air,k)\ is not bounded by any polynomial in \£{M.j.,k)\ = 2fc. 

Proposition 2.3 Computing the optimal objective value of the 1-dimensional nonlinear matroid op- 
timization problem over matroids M , given explicitly or by an oracle, on n-element ground sets, and 
a univariate function f presented by a comparison oracle, cannot be done in polynomial time. In 
particular, with the single weight vector w := (1, 2, 4, ... , 2"~^), solution of the nonlinear matroid op- 
timization problem requires examining f(W{B)) for each of the \B{M)\ bases of M . In particular, for 
each r, 1 < r < k, the problem cannot be solved in polynomial time for the class of matroids M.r,k ■ 
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Proof. The weights w{S) = X^jg^S-'"-^ of the 2" subsets S N attain precisely all 2" distinct 
values 0, 1, . . . , 2" — 1. Since the function / is arbitrary, this implies that the objective value f(W{B)) 
of each base B can be arbitrary. Therefore, if the value f{W{B)) of some base B is not examined, it 
may be that this value is the unique minimum one and the nonlinear matroid optimization problem 
cannot be correctly solved. The final remark of the proposition follows since, for every r, is 
not bounded by any polynomial in \£{M.r,k)\ = 2fc. □ 

Proposition 2.4 Computing the optimal objective value of the nonlinear matroid optimization prob- 
lem in variable dimension d = n, over any matroid M, with {0, l}-wa/ued weights, the i-th weight 
being the standard unit vector Wi := li in M" for all i, and with f : — > M a function presented by a 
comparison oracle, requires examining f{W{B)) for each of the \B{M)\ bases of M. In particular, for 
each r, 1 < r < k, the problem cannot be solved in polynomial time for the class of matroids A4r,k ■ 

Proof. The 1^-profiles W{B) = {wi{B), . . . ,Wn{B)) of the 2" subsets B (1 N attain precisely all 
2" distinct vectors in {0, 1}". Since the function / is arbitrary, this implies that the objective value 
f{W{B)) of each base can be arbitrary. The rest of the argument is as in the proof of Proposition l2.3[ □ 

Consider nonlinear matroid optimization with a matroid M, weights ifi, . . . , it;^ € Z", and function 
/ : M"* ^ M. As explained in ^2.21 a crucial component in solving the problem is to identify VF-profiles 
of bases; that is, points u G Z'^ satisfying u = W{B) for some B E B{M). The following proposition 
shows that, with binary encoded weights, both the nonlinear matroid optimization problem and the 
problem of deciding if a given u is a VF-profile of some base, are intractable already for fixed d = 1 
and uniform matroids or partition matroids that are the direct sums of rank-1 uniform matroids. 

Proposition 2.5 Given matroid M, a single nonnegative weight vector w € N", and nonnegative 
integer w G N, encoded in binary, the following problems are NP-complete, already when restricted to 
the class of matroids M.r,k, for any r, 1 < r < k: 

1. Determining whether u = W{B) = ^jg^ Wj for some base B G B{M). 

2. Determining whether the optimal objective value is zero for the 1-dimensional nonlinear matroid 
optimization problem over M, with the explicit convex univariate function f(y) := {y — u)'^ . 

Proof. The NP-complete subset-sum problem is to decide, given ao,ai, . . . ,ak G N, whether there 
is a subset S C K = {l,...,k} with J^jes'^j ~ '^o- Given such Oj, let u := oq, and let w := 
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(ai, . . . , Ofc, 0, . . . , 0) G X N^. Then, for any r, a base B of Mr^k satisfies W{B) = li if and only if 
S := B (1 K satisfies J2jeS ~ ^o- This reduces the subset-sum problem to the problem considered 
in the first part of the proposition, showing that it is indeed NP-complete. 

For the second part, note that the objective value f{W{B)) = {W{B) — n)^ of every base B 
is nonnegative, and B has f{W{B)) = if and only if W{B) = u. Thus, the optimal objective 
value is zero if and only if there is a base with W{B) = u. So the problem in the first part of the 
proposition reduces to the problem in the second part, showing the latter to be NP-complete as well. □ 



3 Arbitrary Matroids 

In this section we develop a combinatorial algorithm for nonlinear matroid optimization, that runs in 
polynomial time for any matroid presented by an independence oracle, provided that the number p 
of distinct values taken by the entries wtj of the weight vectors is fixed. In particular, the algorithm 
applies to {0, l}-valued weight vectors as well as to {0, 1, . . . ,p}-valued weight vectors for any fixed p. 

As explained in we will filter the set U = {W{B) : B £ B{M)} of ly-profiles of bases out 
of a suitable superset Z. For this, we next show how to efficiently decide if a given n E Z'' satisfies 
u = W{B) for some B € B{M). We start with {0, l}-valued W^-profiles with pairwise disjoint supports. 

Lemma 3.1 There is an algorithm that, given matroid M presented by an independence oracle on 
the n-element ground set N, weight vectors wi, . . . ,Wd € {0, 1}*^ with pairwise disjoint supports, and 
u € N*^, determines if M has a base B with W -profile W{B) = u, in time polynomial in n and (n). 

Proof. For each base B oi M and for each i = 1, . . . , d we have Wi{B) = \B n supp{wi)\. Therefore, 
a base B has W{B) = u if and only if \B n supp(u;j)| = Uj for i = 1, . . . ,d. So we may and do assume 
X^f=i Ui < rank(M) and Ui < \supp{vui)\ for all i else M has no base with W{B) = u. Let 

B' ■.=i^B C N : \B n supp{wi)\ = Ui , i = l,...,d, 

Bn(^N\[jt,snpp{wi)) 

It is easy to see that B' = B{M') is the set of bases of a partition matroid M' on ground set A^, for 
which an independence oracle is efficiently realizable. Moreover, M has a base B with W{B) = u \i 



rank(M) - Ya=i 
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and only if B{M) D B{M') is nonempty. These observations justify the following algorithm: 



if Ym=i > rank(M) or Ui > \supp{wi)\ for some i = I, . . . ,d then return NO; 

Determine if B{M) n B{M') is nonempty by computing a max-cardinality S € I[M) nX(M'); 

if IS I = rank(M); 

then 

return YES and S := S] 

else 

return NO; 

end 



Computing a max-cardinality S G Z{M) r)I{M') can be efficiently carried out using a maximum- 
cardinality matroid-intersection algorithm, see e.g. [9] or [10] and references therein. □ 

Next we consider weight vectors for which the number p of distinct Wij values is fixed. So, we 
assume that wi, . . . ,Wd € {oi, . . . , Cp}" for arbitrary given integer numbers ai, . . . ,ap. Note that the 
Oj can vary and be very large, since they affect the running time through their binary length (oj). 

Lemma 3.2 For every fixed d and p, there is an algorithm that, given matroid M presented by an 
independence oracle on ground set N, integers oi, . . . , a^, weight vectors vui, . . . ,Wd G {ai, . . . , Cp}", 
and u G N'^, decides if M has a base B with W{B) = u, in time polynomial in n, max(aj), and (u). 

Proof. Let V := {oi, . . . , Op}'^ and let P be the d x p'^ pattern matrix having columns that are all 
of the p'^ points in V. Let, as usual, W be the d x n matrix with rows vui, . . . , Wd- For j = 1, . . . , n, let 
denote the j-th column of W. We will exploit the fact that, no matter how large n is, the columns 
of W all lie in the fixed set ^ - so the number of distinct columns of W is limited. 

Define a p^ x n selector matrix W, having rows indexed by V, columns (v^ indexed by A^, and 

^ \ 1, if = V ; 

vjyj := < 

I 0, otherwise, 

for V € V , j N . Note that each column & of is a standard unit vector, selecting the unique 
pattern (i.e., column) of V that agrees with the column of W. It should be clear that the rows of 
W, namely the w^, lie in {0, 1}" and have pairwise disjoint supports - this will enable us to appeal to 
Lemma |3. 11 We observe and will make use of the fact that the weight matrix W factors as W = PW. 
Therefore, the PV^-profile and PF-profile of a base B satisfy W{B) = PW{B) . This implies that there 
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exists a base B G B{M) with W{B) = u \i and only if for some p'^-dimensional vector u satisfying 
u = Pu there exists a base B e B{M) satisfying W{B) = u. Since W is {0, l}-valued, any such 
vector u = W{B) must lie in {0, . . . , mj^'', where m := rank(M). Therefore, checking if there is a base 
B G B{M) with W{B) = u, reduces to going over all vectors u G {0, . . . , m}^'^ , and for each, checking 
if Pu = u and if there is a base B satisfying W{B) = u. This justifies the following algorithm. 

let P be the d x p'^ pattern matrix, and let W be the selector matrix (both determined by W); 
for 2 G {0, 1, ... , m}P'^ do 

if Pu = u then 

if there is a base B G B{M) with W{B) = u then return B; 

end 
end 

return NO; 

Since d and p are fixed and m < n, the number (m + 1)'^'* of such potential vectors u is polyno- 
mial in the data. For each such vector u, checking if Pu = u is easily done by direct multiplication; 
and checking if there exists a base B G B{M) satisfying W{B) = u can be done in polynomial time 
using the algorithm of Lemma l3.ll applied to the matroid M, the d := p'^ weight vectors w^, v £ V 
(the {0, l}-valued rows of the matrix W, having pairwise disjoint supports), and the vector u. □ 

We are now in position to solve the nonlinear optimization problem over a matroid that is presented 
by an independence oracle. 

Theorem 11.11 For every fixed d and p, there is an algorithm that, given a matroid M presented 
by an independence oracle on the n-element ground set N, integers ai,... ,ap G Z, weight vectors 
wi^. . . ,Wd G {oi, . . . ,ap}"', and function f : ^ presented by a comparison oracle, solves the 
nonlinear matroid optimization problem in time that is polynomial in n and max(aj). 

Proof. We have three major steps: 

1. First, under the hypotheses of the theorem, assume that in polynomial time we can calculate 
the optimal objective value of the problem. Then, for every subset S N, the optimal objective 
value of the subproblem on S can be computed in polynomial time, since the entries of the 
restrictions of wi,...,Wd to S also attain values in {ai,...,ap}. By Lemma [2.11 this implies 
that an optimal base can be found and the nonlinear matroid optimization problem solved in 
polynomial time. So, it remains to show that in polynomial time we can calculate the optimal 
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objective value of the problem. 



2. To accomplish this, we first show how to compute the set U of VF-profiles of bases of M, 
by working with an appropriately defined superset Z of U. Let m := rank(M). Consider any 
i = 1, . . . , d and any m-subset B of N. Then, since Wij € {oi, . . . , Op} for all j, we have that, 
for some nonnegative integer vector Aj = (Aj,i, . . . , Ai,p) with Yl^=i ^i,k = 



where a := (oi, . . . , Op). Therefore, we find that set the set U of VF-profiles of bases satisfies 

U = {W{B) : B G B{M)] 

C {W(B) : B QN, \B\ = m} 

C Z := {{Xi-a,...,Xd-a) : Aj G {0, 1, . . . , m}^ , A, • 1 = m , ^ = 1,...,^} . 

These observations justify the following algorithm to compute the set U of 1^-profiles of bases: 

Compute m := rank(M) and let a := (oi, . . . , Op); 

Start with Z := 0; 

for A G {0,1,... ,myP do 

if Al = ml then let Z := Z U {Aa}; 

end 

Start with U := 0; 
for u G Z do 

if there is a base B € B{M) with W{B) = u then let U := U U {u}; 



Observe that, since d and p are fixed, \Z\ < |{0, 1,... ,m}'^^*'| = (m + 1)^*^ is polynomially 
bounded in m < n and hence so are the numbers of iterations in each of the "for" loops of the 
algorithm. Also note that, in each iteration of the second loop, we can apply the algorithm of 
Lemma [3.2l to determine, in polynomial time, whether there is a base B G B{M) with W(B) = u. 
Therefore, we can efficiently determine U. 

3. By repeatedly querying the comparison oracle of / on \U\ — 1 suitable pairs of points in U, we 
obtain /* := mm{f{u) : u G U}, which by Proposition 12.21 is the optimal objective value. 



p 




k=l 



end 



return U; 
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Finally, by embedding steps 2-3 above as subroutines to solve the re + 1 restrictions of the problem 
to suitable subsets 5 C in step 1, we obtain our desired algorithm. □ 



4 Vectorial Matroids 

In this section we develop an algebraic algorithm for nonlinear matroid optimization over vectorial 
matroids. It runs in time polynomial in the binary length (A) of the matrix A representing the matroid 
and in the unary length max|it;jj | of the weights. It is much more efficient than the combinatorial 
algorithm of §3 and applies to weights with an unlimited number of different values Wij of entries. 

First, we show that it suffices to deal with nonnegative weight vectors. 

Lemma 4.1 The nonlinear matroid optimization problem with arbitrary integer weight vectors 
wi ... ,Wd G '^'^ is polynomial-time reducible to the special case of nonnegative vectors wi . . . ,Wd £ N". 

Proof. Consider a matroid M, integer weig hts wi,...,Wd G Z", and function f : ^ R. Let 
m := rank(M) and q := max Define nonnegative weights by vu'^ j := vuij + q for all i,j, and define 

a new function /' : M"' ^ M by f'{yi, ...,yd):= f{yi-mq, . . .,yd-mq) for every y = (yi, • • • , yd) G M"*. 
Note that the unary length of each new weight w'^ j is at most twice the maximum unary length of the 
original weights Wij, and a comparison oracle for /' is easily realizable from a comparison oracle for 
/. Then for every base B and for each i = 1, . . . ,d, we have 

w'i{B) = ^Wij = "^{wij + q) = {"^Wij] +mq = Wi{B) + mq , 

implying the following equality between the new and original objective function values, 

nw[{B),...,w'd{B)) = f'{wi{B)+mq,...,Wd{B)+mq) = f{wi{B),...,Wd{B)). 

Therefore, a base B G B{M) is optimal for the nonlinear matroid optimization problem with data M, 
wi,...,Wd and / if and only if it is optimal for the problem with M, w'l, . . . ,w'^ and /'. □ 

So we assume henceforth that the weights are nonnegative. As explained in §2.21 and carried out 
in §3, we will filter the set U = {W{B) : B G B{M)} of Vl^-profiles of bases out of a suitable superset 
Z. However, instead of checking if u G C/ for one point u ^ Z after the other, we will filter here the 
entire set U out of Z at once. We proceed to describe this procedure. 
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Let ^ be a (nonempty) m x n integer matrix A of full row rank m, and let M be the vectorial 
matroid of A. Note that m = rank(M). Let wi, . . . G N" be nonnegative integer weight vectors and 
let q := maxwij. Then for each i = 1, . . . ,d and each m-subset B of N, we have Wi{B) £ {0, 1, . . . , mq}, 
and therefore 

U = {W{B) : B G 13{M)} C {W{B) : B C N, \B\ = m} 
C Z := {0,1,..., mqy C N'^ . 

We will show how to filter the set U out of the above superset Z of potential VF-profiles of bases. 
For each base B € B{A4), let A^ denote the nonsingular m x m submatrix of A consisting of those 
columns indexed hj B <^ N. Define the following polynomial in d variables yi, . . . , y^: 

d 

g = g{y) := ^ ^.y" := E 11 > (2) 

uez uez k=i 

where the coefficient gu corresponding to n € Z is the nonnegative integer 

gu := J^{det2(A^) : i3GS(M), W{B) = u} . (3) 

Now, det^(A'^) is positive for every base B G B{M). Thus, the coefficient gu corresponding to u £ Z 
is nonzero if and only if there exists a matroid base B G B{M) with W{B) = u. So the desired set 
U is precisely the set of exponents of monomials y" with nonzero coefficient in g. We record this for 
later use: 

Proposition 4.2 Let M he the vectorial matroid of anmxn matrix A of rank m, let wi, . . . ,Wd G N"', 
and let g{y) he the polynomial in Then U := {W{B) : B G B{M)} = {u e Z : guT^O}- 

By Proposition 1121 to compute U, it suffices to compute all coefficients gu- Unfortunately, they cannot 
be computed directly from the definition ([3]), since this involves again checking exponentially many 
B G B{M) - precisely what we are trying to avoid! Instead, we will compute the gu by interpolation. 
However, in order to do so, we need a way of evaluating g{y) under numerical substitutions. We 
proceed to show how this can be efficiently accomplished. 

Let Y be the nxn diagonal matrix whose j-ih. diagonal component is the monomial Y]^^i y^''^ in 
the variables yi, . . . , y^; that is, the matrix of monomials defined by 

Y := diagfnyr'\...,nyr 

\i=l i=l 

The following lemma will enable us to compute the value of g{y) under numerical substitutions. 
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Lemma 4.3 For any m x n matrix A of rank m and nonnegative weights wi, 



, tt;^ G we have 



giy) = det{AYA'^) . 

Proof. By the classical Binet-Caucliy identity, for any two mx n matrices C, D of rank m we have 
det{CD^) = E{det(C^) det{D^) : B G B{M)]. Applying this to C := AY and D := A, we obtain 

d 

det{AYA^) = det((^y)^det(^^)) = JJ JJ '■Met(^^) det(A^) 

BeB{M) BeB{M) jeBi=l 

d d 



;B(A/) i=l 

^guy'' = g{y) ■ 



B&B(M)i=l ueZ BeB{M): i=l 

W(B)=u 



u£Z 



Lemma 143) paves the way for computing the coefficients of the polynomial g{y) = ^^^z d^y^ by 
interpolation. We will choose sufficiently many suitable points on the moment curve in M'^, substitute 
each point into y, and evaluate g{y) using the lemma. We will then solve the system of linear equations 
for the coefficients gu- The next lemma describes the details and shows that this can be done efficiently. 



Lemma 4.4 For every fixed d, there is an algorithm that, given any m x n matrix A of rank m and 
weights wi, . . . ,Wd G N", computes all coefficients gu of g{y) in time polynomial in maxwij and (A). 

Proof Let q := maxtf;,^- and p := mq + 1. Then a superset of potential VF-profiles of bases is 
Z := {0, 1, . . . ,mq}'^ and satisfies \Z\ = p"^. For t = 1,2, .. . ,p'^, let Y{t) be the numerical matrix 
obtained from Y by substituting t^' ^ for yi, i = 1, . . . ,d. By Lemma 14.31 we have g{y) = det{AYA'^), 
and therefore we obtain the following system of p'^ linear equations in the p"^ variables gu, « G Z: 

det{AY{t)A^) = det ( AdiagjiHt""-^ P'~'\ aA = X]^"!!*"'^"' 

V \i=i / / uez 1=1 

uez 

As u runs through Z, the sum X]f=i '^iP^~^ attains precisely all \Z\ = p'^ distinct values 0, 1, . . . — 1. 
This implies that, under the total order of the points li in Z by increasing value of Yli=i UiP^~^j the 
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vector of coefficients of the gu in the equation corresponding to t is precisely the point {t^, t^, . . . 

on the moment curve in M.^ = Mp'*. Therefore, the equations are hnearly independent and hence the 

system can be uniquely solved for the gu- 



These observations justify the following algorithm to compute the gu, u £ Z : 



Compute m := rank(A); 




let q := maxwij , and let p:=mq+l ; 




iety:=diag,(ntiyr") ; 




for t = 1,2, . . . ,p'^ do 




let Y{t) be the numerical matrix obtained by substituting for yj, i = 1,2, . . 


. ,d, in Y; 


Compute detiAYit)^^); 




end 




Compute and return the unique solution gu, u £ Z , of the square linear system: 




det{AY{t)A'^) = , u £ Z ; 





We now show that this system can be solved in polynomial time. First, the number of equations 
and indeterminates is p"^ = (mq + l)'^ and hence polynomial in the data. Second, for each i,j = 1, . . . , n 
and t = 1,2, . . . ,p'^, it is easy to see that the («,j)-th entry of AY{t)A'^ satisfies 

n d n 

h=l k=l h=l 

implying that the binary length {AY{t)A'^) of AY{t)A'^ is polynomially bounded in the data as well. 

It follows that det{AY{t)A^) can be computed in polynomial time by Gaussian elimination for all 
t, and the system of equations can indeed be solved for the gu in polynomial time. We further note 
that the system of equations is a Vandermonde system, so the number of arithmetic operations needed 
to solve it is just quadratic in its dimensions. □ 

We can now efficiently solve the nonlinear optimization problem over vectorial matroids with unary 
weights. 

Theorem [TT3] For every fixed d, there is an algorithm that, given integer mxn matrix A, weight vec- 
tors wi, . . . ,Wd € Z", and function / : M*^ — > M presented by a comparison oracle, solves the nonlinear 
optimization problem over the vectorial matroid of A in time polynomial in (A) and max |tCij|. 

Proof. Let M be the vectorial matroid of A. Recall that linear- algebraic operations on A can 
be done in polynomial time, say by Gaussian elimination. Dropping some rows of A if necessary 
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without changing M, we may assume that A has rank m. An independence oracle for M is readily 
realizable since S N is independent in M precisely when the columns of A indexed by S are linearly 
independent. Applying, if necessary, the procedure of Lemma |4. II and adjusting the weights while at 
most doubling the unary length of the maximum weight, we may also assume that the weights are 
nonnegative. 

We will show how to compute the optimal objective value /* := mm{f{W{B)) : B € B{M)} in 
polynomial time. This will also imply that, for every subset S Q N, the optimal objective value of the 
subproblem on S can be computed in polynomial time. By Lemma l2.1l this will show that an optimal 
base can be found and the nonlinear matroid optimization problem solved in polynomial time. 

Let q := m.ax.Wij and consider the superset Z := {0, 1, . . . ,mg}'^ of potential VF-profiles of bases 
and the polynomial g{y) = J2uez d-^U^ defined in ^ and ([3]). By Proposition 14.21 we have 

U = {W{B) : BgB{M)} = {uGZ : Qu^O} . 

Applying now the algorithm of Lemma 14.41 we can compute in polynomial time the right-hand side 
and hence the left-hand side, providing the filtration of the set U of PF-profiles of bases out of 
Z. By repeatedly querying the comparison oracle of / on suitable pairs of points in U, we obtain 
/* := min{/(u) : u G U} which by Proposition 12.21 is the desired optimal objective value. □ 



5 Experimental Design 

We now discuss applications of nonlinear matroid optimization to experimental design. For general 
information on experimental design see e.g. the monograph [15j and references therein. As outlined in 
the introduction, we consider the following rather general framework. We wish to learn an unknown 
system whose output y is an unknown function <I> of a multivariate input x = {xi, . . . , x^) S M'^. It is 
customary to call the input variables factors of the system. We perform several experiments. Each 
experiment i is determined by a point pi = . . . ,Pi,k) and consists of feeding the system with input 
X := Pi £ M.'' and measuring the corresponding output yi := G M. Based on these experiments, 

we wish to fit a model for the system, namely, determine an estimation ^> of that: 

• Lies in a prescribed class of functions; 

• Is consistent with the outcomes of the experiments; 

• Minimizes the aberration - a suitable criterion - among models in the class. 
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We concentrate on (multivariate) polynomial models defined as follows. Each nonnegative integer 
vector a S N'^ serves as an exponent of a corresponding monomial := 11^=1 ^h'^ ™ system input 
X € M'^. Each finite subset i3 C of exponents provides a model for the system, namely a polynomial 
supported on B, i.e. having monomials with exponents in B only, 

where the Cq, are real coefficients that need to be determined from the measurements by interpolation. 

We assume that the set of design points {pi, . . . ,pm} C M'^ is prescribed. Indeed, in practical 
applications, it may be impossible or too costly to conduct experiments involving arbitrarily chosen 
points. The problem of choosing the design (termed the inverse problem in the statistics literature, 
see [2] and references therein), is of interest in its own right, and its computational aspects will be 
considered elsewhere. We collect the design points in an m x /c design matrix P. Thus, the i-th. row of 
this matrix is the i-th design point pi. A model B C is identifiable by a design P if for any possible 
measurement values yi = ^{pi) at the design points, there is a unique polynomial <&b(x) supported on 
B that interpolates that is, satisfies ^siPi) = Hi = ^(Pi) for every design point pi = (pi,i, . . . ,Pi,k)- 

Among models identifiable by a given design, we wish to determine one that is best under a suitable 
criterion. Roughly speaking, common criteria ask for low degree polynomials. To make this precise, for 
each identifiable model B, consider the following average degree vector whose i-th. entry is the average 
degree of variable Xi over all monomials supported on B : 




Now, given any function / : M'^ — > M, the aberration of model B induced by / is defined to be 



A{B) : = 




The term aberration is the one used in the statistics literature in this context, see e.g. [5l [18] and the 
reference therein. We now give some concrete examples of functions providing useful aberrations. 

Example 5.1 Some concrete useful aberrations. 

• Consider the function f{x) := xi + • • • + x^. Then the aberration of model B is 

' ' aeB ' ' aS-B ' ' oG-B i=l 

which is the average total degree of monomials supported on B. 
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Consider f{x) := vrixi + • • • + iTkXk for some real weights vri, . . . , vrfc. Then 

is the weighted average degree, ahowing for preferences of some variables over others. 
Consider the function /(x) := maxjxi, . . . ,Xk}. Then the aberration of B is 



A{B) = maxi ^ai,...,-^ ^ afc I , 



and is the maximum over variables of the average variable degree of monomials supported on B. 

k - 

More generally, consider f{x) := ||7r • x\\q = (X]i=i Ki^^il*^) ' • Then the aberration of B is 



A{B) 



B\ ■ " 

' qG-B 



which is the /^-norm of the weighted average degree vector of monomials supported on B. 



We can now formally define the minimum-aberration model-fitting problem. 

Minimum- Aberration Model-Fitting Problem. Given a design P = {pi, . . . ,Pm} of m points in 
M'^, a set N = . . . , /?„} of n potential exponents in N'^, and a function / : ^ M, find a model 
B N that is identifiable by P and is of minimum aberration 

A{B) = 




It can be verified that a model B is identifiable by a design P if and only if i? = . . . , (3^} C N'^ 
for some m exponents Pj = . . . , /3j^k): j = 1, ■ ■ ■ ,m, and the m x m matrix T, defined by 



k 



is invertible. (In the terminology of algebraic geometry, the model B is identifiable if the congruence 
classes of the monomials x^^ , • • • , x^"" form a basis for the quotient of the algebra of polynomials 
M[xi, . . . , Xk] modulo the ideal of polynomials vanishing on the design points; see [3l[T3] and references 
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therein for more on this.) If B is identifiable then, given any vector of measurements y G M™" at the 
design points, the vector of coefficients c G M"^ of the unique polynomial ^b{x) = X^jLi 11^=1 ^h' '^ 
supported on B that is consistent with the measurements, is given by c := T^^y. 

It should be now quite clear how to formulate the minimum-aberration model-fitting problem over 
a design P = {pi, . . . ,Pm}, a set N = . . . , /3„} of potential exponents, and a function / : M'^ ^ M, 
as a nonlinear matroid optimization problem. Consider the following m x n matrix A having rows 
indexed by P and columns indexed by N, defined by 

k 

«ij ■= Pi = [[Pi'h ' ^ = 1, . . . ,m , J = 1, . . . ,n . 

h=l 

Then a model B C N is identifiable by P if and only if the submatrix of A comprising columns indexed 
by B is invertible. If the rank of A is less than m, then no i? C is identifiable, and the set N of 
potential exponents should be augmented with more exponents. So assume that A has rank m. Let 
M be the vectorial matroid of A, so that 

B{M) := {B QN : B is identifiable by P} . 

Now define k weights vectors . . . , G N" by Wij := ^Pj^i fox i = 1, . . . ,k, j = 1, . . . ,n. Then the 
aberration of model B is 

= /f^E^O = M E^l.^•'•••'E^'^.^■ 
Thus, the aberration of a model B identifiable by the design P is precisely the objective function value 
of the base B in the nonlinear matroid optimization problem over the matroid M above, with d := k 
and the weights wi . . . ,Wk G N" as above, and with the given function / : R. Assuming that 

a comparison oracle for the function / can be realized, which is practically always true, and that the 
design points are rational so that they can be input and processed on a digital computer, we obtain 
the following corollary of Theorems 11.11 and II. 3t 

Corollary 11.51 For every fixed k, there is an algorithm that, given a rational design P = {pi, . . . ,Pm} 
in M'^, a set N = . . . ,/?«} in N'^, and a function / : M'^ ^ M presented by a comparison oracle, 
solves the minimum-aberration model-fitting problem in time polynomial in m,n, (P) and max l3ij. 

It is very natural and common in practice to consider hierarchical models; that is, models B with 
the property that P < a £ B implies (3 £ B. In [3l [13], in the context of the theory of Grobner bases 
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in commutative algebra, it was shown that the smallest set containing all m-point hierarchical models 
B (termed staircases therein) in is the following set, consisting of roughly 0(m log m) points, 

k 

N := {aen'' : JJ(aft + l)<m}. 

h=l 

Thus, Corollary 1 1 . 5 1 will be typically applied with this set N as the set of potential monomial exponents. 

We proceed to describe a more general useful class of aberrations that can be treated, which is 
naturally suggested by the nonlinear matroid optimization formulation. As before, we are given a 
design P = {pi, . . . ,Pm} in and a set = {/3i, . . . , of potential exponents in N*^. But now we 
are also given d weight vectors . . . , it;^ G Z". The function / is now defined on rather than M.^. 
The aberration induced by the weights and the function is now simply the objective function of the 
nonlinear matroid optimization problem, which for an identifiable model B Q N is given by 

MB) := fiWm , 

where W is the matrix with rows Wi. Note that aberrations of the type considered before can be 
recovered as a special case with d := k and Wij := for all Here are a few useful examples. 

Example 5.2 Some concrete useful generalized aberrations. Let N = . . . , /?„} C N'^ be 

any set of exponents. Let ^ be a small positive integer, say ^ = 1 or ^ = 2, that will serve as a desired 
bound on the degrees of variables in monomials of the sought after models B contained in N. 

• Let d := 1, and define the single weight vector wi S N" by 

{0, 0ii < 9 , for every i = 1, . . . ,k : 
1, otherwise, 

for J = 1, . . . , n. Then X]/3jGB ^ij the number of monomials supported on B Q N that do not 
meet the degree bound; in particular, for = 1 it is the number of non square-free monomials. 
Taking / : M — > M to be the identity /(y) := y, the aberration A{B) of model B is the number of 
undesired monomials. In particular, an optimal model B has A{B) = if and only if the design 
admits an identifiable model with all variables in all monomials having degree at most 9. 

• Now let d := k, and define weight vectors wi, . . . ,Wk G N" by 

/ 0, if Aj- < 9 ; 
I 1, otherwise, 
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for z = l,...,k, j = l,...,n. Then Ylpj^B'^hj number of monomials supported on 

model B C N for which variable Xi violates the degree bound 9. Defining / : M'^ ^ M by 
f{y) := max^^j^ yi, we get that the aberration A{B) of model B is the maximum over variables 
of the number of monomials having Xi violating the degree bound 6. The optimal model will 
minimize the maximum violation. 

We have the following generalized minimum-aberration model-fitting problem and corollary. 

Generalized Minimal- Aberration Model-Fitting Problem. Given a design P = {pi, ■ ■ ■ ,Pm} 
in M*^, a set = . . . ,/?„} of potential exponents in N*^, weight vectors wi, . . . ,Wd G Z", and a 
function f : M!^ M., find a model B C. N that is identifiable by P and is of minimum aberration 

A{B) = fiW{B)) = / E E 

Corollary 5.3 For every fixed k and d, there is an algorithm that, given any rational design 
P = {pi, ■ ■ ■ ,Pm} 'in any set N = . . . , /3„} of potential exponents in N'^, weight vectors 
VJi, . . . ,Wd G Z", and function f : ^ R. presented by a comparison oracle, solves the generalized 
minimum- aberration model-fitting problem in time polynomial in m,n, (P), max/3j^i and max\wij\. 
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